New Experiments in Distributional Representations of Synonymy

نویسندگان

  • Dayne Freitag
  • Matthias Blume
  • John Byrnes
  • Edmond Chow
  • Sadik Kapadia
  • Richard Rohwer
  • Zhiqiang Wang
چکیده

Recent work on the problem of detecting synonymy through corpus analysis has used the Test of English as a Foreign Language (TOEFL) as a benchmark. However, this test involves as few as 80 questions, prompting questions regarding the statistical significance of reported results. We overcome this limitation by generating a TOEFL-like test using WordNet, containing thousands of questions and composed only of words occurring with sufficient corpus frequency to support sound distributional comparisons. Experiments with this test lead us to a similarity measure which significantly outperforms the best proposed to date. Analysis suggests that a strength of this measure is its relative robustness against polysemy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Revision of the Euphorbia Dioscoreoides Complex (Euphorbiaceae)

A revision of the Euphorbia dioscoreoides complex (subgenus Agaloma) is provided. Euphorbia dioscoreoides ssp. attenuata and E. eglandulosa, both from Mexico, are proposed as new; E. digitata is reduced to synonymy under E. subpeltata. Representative specimens are cited, and distributional and ecological data are provided.

متن کامل

A checklist of stag beetles (Coleoptera: Scarabaeoidea: Lucanidae) from Iran.

An updated checklist of the Lucanidae (Coleoptera) from Iran is given. New locality records are listed and some dubious distributional records are discussed. Dorcus vavrai Nonfried, 1905 is placed in synonymy with Dorcus peyronis Reiche and Saulcy, 1856 (new synonymy) The female of Lucanus xerxes Král, 2004 is described. A key for the identification of the Iranian stag beetle species is also pr...

متن کامل

A Computational Holographic Model of Memory for Abstract Associations

How do humans learn the syntax and semantics of words from language experience? How does the mind discover abstract relationships between concepts? Computationalrelationships between concepts? Computational models of distributional semantics can analyze a corpus to derive representations of word meanings in terms of each word’s relationship to all other words in the corpus. While these models a...

متن کامل

Bridging the distributional gap of Tylorida striata (Thorell, 1877) and new synonymy (Araneae: Tetragnathidae)

BACKGROUND Although Tyloridastriata has not been reported from India, observations on India Biodiversity Portal (IBP 2015), an open access repository for biodiversity information of Indian subcontinent, showed images resembling this species. The respective locality in Gujarat, India was explored and specimens were studied to confirm record of T.striata in India. Literature study showed some tax...

متن کامل

New species and records of Charisius Champion from Mexico and Central America (Coleoptera, Tenebrionidae, Alleculinae)

The species of the genus Charisius Champion, from Mexico and Central America are reviewed. The flightless genus Narses Champion, with one included species, N. subalatus Champion, is placed in synonymy with the genus Charisius. Four new species are described and illustrated, C. granulatus and C. punctatus (from Guatemala) and C. apterus and C. howdenorum (from Mexico). Charisius subalatus (Champ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005